System and method of retrieving and presenting partial (skipped) document content

ABSTRACT

A method and mobile communications device for retrieving and presenting partial skipped content are provided. A display of a mobile communication device provides content of a first segment and a further segment of an electronic document received at the mobile communication device. Content between the first segment and the further segment is skipped content which has not been received. On the display, an indication of the skipped content is provided, the indication including a selectable client control for retrieving at least a portion of the skipped content.

CROSS-REFERENCE

This application is a continuation of U.S. application Ser. No.12/036,650, filed Feb. 25, 2008 and issued as U.S. Pat. No. 8,069,410,which is a continuation of U.S. application Ser. No. 10/988,282, filedNov. 12, 2004, issued as U.S. Pat. No. 7,363,582, which claims benefitof U.S. Provisional Application No. 60/519,973, entitled “SYSTEM ANDMETHOD OF RETRIEVING AND PRESENTING PARTIAL (SKIPPED) DOCUMENT CONTENT”filed Nov. 14, 2003, the disclosures of which are incorporated herein byreference in their entireties.

BACKGROUND

This application relates generally to content delivery of documents, andspecifically to retrieving partial document content to be displayed on alimited memory and/or bandwidth device and indicating the skippedcontent from the original document to the user.

Electronic documents are produced using various computer programs, suchas word processors, spreadsheet programs, financial software, andpresentation software. In addition to textual information, documents cancontain navigational information in the form of table of content,bookmarks and hyperlinks. This navigational information allows the userof the computer programs the ability to quickly move to relevantsections of the document by clicking on a navigational link as definedby a table of content or hyperlink pointing to a bookmark. By using anavigational link the computer program will immediately reposition thecontent in the document to the desired section of the document andthereby eliminate the need for the user to traverse the document on asequential page-by-page basis until the desired section of content isfound.

When the user of a mobile communication device, or other limited memoryand/or bandwidth device, wishes to access an electronic document whichresides on a remote computer, and view the document on the mobilecommunication device, the entire electronic document is typicallytransmitted over a potentially bandwidth-constrained wireless network tothe mobile communication device. For example, if a user wishes to viewonly the one-page terms and conditions section at page 100 of a 400-pagedocument, the preceding 99 pages must first be transmitted to thedevice, which is costly and time consuming from a network limitationperspective. Yet, the user only views one page on the mobile device(i.e. page 100).

Once on the device, the electronic document is viewed using the mobilecommunication device's user interface, which typically differs from theuser interface used to create and view the document on a personalcomputer. For example, while the user interface on a personal computerused to create an electronic document may include a large, color displayand a pointing device such as a mouse, the mobile communication devicemay have a small, non-color screen, and may not have a mouse. Inaddition, the mobile communication device typically has greaterprocessing power and memory limitations than a computer used to view theelectronic document, which may be very large in size.

SUMMARY

Systems and methods of retrieving and presenting partial (skipped)document content are provided. A typical system includes a documentparsing stage, a device request for content stage and a delivery ofrequested content stage. A target device can include a devicepresentation of the requested content stage.

The system parses the content of an electronic document and generates adocument object model (DOM) representation of the electronic document.For example, where the electronic document contains navigationalelements such as table of content, hyperlinks and bookmarks, thenavigational information such as source and target destination isrecorded in the document object model (DOM), where each source anddestination target is assigned a document object model identifier (DOMID). In some instances, upon completion of parsing, the document objectmodel is then paginated in computer memory into segments of a certainsize called chunks; in such instances the pagination operation canupdate each DOM ID with an additional information attribute containingthe chunk index with the purpose of establishing a relationship betweena document object model identifier and the corresponding chunk index.Thus, when a device requests specific content corresponding to the DOMID the chunk index value indicates the correct chunk to be returned tothe device.

A method of requesting document content from a device is alsorepresented. In some instances, the method of requesting partial contentinitiated from the device can be based on navigational informationrepresented by table of content, hyperlinks and bookmarks originatingfrom the original document content facilitating the ability to skipcontent between the source and target link and thus facilitating thenear desktop experience of navigating to a different part of thedocument without navigating through the document on a sequential page bypage basis. The wireless device initiates a document viewing experienceby requesting, parsing and displaying an initial chunk of documentcontent as delivered by the corresponding server element. The deviceside component can, upon parsing the data chunk, record navigationalinformation provided in the form of link information containingcorresponding DOM ID and Chunk Index information. The link informationcan in certain instances be presented visually by a dotted underline,although alternative presentations would be feasible.

Additional advantages will be set forth in part in the description whichfollows, and in part will be obvious from the description, or may belearned by practice. The advantages will be realized and attained bymeans of the elements and combinations particularly pointed out herein.It is to be understood that both the foregoing general description andthe following detailed description are exemplary and explanatory onlyand are not restrictive.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and constitute apart of this specification, illustrate embodiments set forth in thedescription.

FIG. 1 is a diagram depicting a typical transmission path for a documentconversion request.

FIG. 2 is a block diagram of architectural components in a typicalserver side document analysis and pagination system.

FIG. 3 is a flow chart depicting a document navigational pointsidentification process.

FIG. 4 is a flow chart depicting a pagination process.

FIG. 5 depicts an exemplary navigation screen displayed on the viewingdevice created via a server side document analysis and pagination.

FIG. 6 depicts an example interface for navigating via the navigationalscreen of FIG. 5.

FIG. 7 is an example screen showing a portion of document navigated tovia the navigational screen with an indication as to content skippedthrough the navigational process.

FIG. 8 is an example screen depicting further navigation from theexample seen in FIG. 7.

DETAILED DESCRIPTION

Various examples are now described in detail. Referring to the drawings,like numbers indicate like parts throughout the views. As used in thedescription herein, the meaning of “a,” “an,” and “the” includes pluralreference unless the context clearly dictates otherwise. Also, as usedin the description herein, the meaning of “in” includes “in” and “on”unless the context clearly dictates otherwise. Finally, as used in thedescription herein, the meanings of “and” and “or” include both theconjunctive and disjunctive and may be used interchangeably unless thecontext clearly dictates otherwise.

The foregoing description makes reference to a specific type of documentfor distribution to a potential low bandwidth and/or memory constraineddevice, namely an e-mail attachment. The present application however isnot so limited; rather, the disclosed systems and methods can be appliedto any type of distribution not simply attachments to e-mails.

Email attachments typically are in the form of documents created in wordprocessing, spreadsheets, or presentation applications. Some of thesedocument types such as word processing, presentations and also web pagessupport navigational features such as table of content, hyperlinks andbookmarks. The purpose of navigational features such as these is toallow the user of an application to quickly navigate between differentparts or sections of a document. This is useful from a user perspectiveas such documents typically can consist of large amounts of textualcontent resulting in a document span several hundred pages or more.Using navigational features such as table of content, hyperlinks andbookmarks dispenses with the requirement to manually navigate thedocument by traversing it page-by-page until a desired section of thedocument is located. A simple click on a hyperlink or table of contententry will reposition the application content display to the desiredsection, thus eliminating manual page-by-page navigation or scrollingthrough the document.

Furthermore, a table of content entry or hyperlink consists of what isknown as source definition and a target destination. The latter istypically known as a bookmark. These definitions exists as detectableentities in the original source document that the originating sourceapplication utilizes for navigational purposes. As an example, if a userwishes to see terms and conditions on page 400 of 450 in a legaldocument then clicking on the “Terms and Conditions” entry in the tableof content or the equivalent hyperlink will immediately position theuser at page 400 when viewing the document in the originating sourceapplication.

Many types of computing devices have transmission bandwidth and/ormemory constraints. The foregoing description uses a wireless device anexample of such a device; other examples include, without limitation,pocket computers, PDAs, mobile phones, etc. It should be understood thatreference to wireless devices in the foregoing does not limit theapplicability of the disclosed systems and methods with respect to otherconstrained devices.

Wireless devices today are limited compared to desktop computers when itcomes to aspects such as display real estate, processing power (CPU),storage, available power supply (battery) and network capability in theform of available bandwidth for transfer of data.

Referring to the “Terms and Conditions” example presented above, in awireless device scenario, retrieval of the content of the preceding 399pages of the original document would be required over today's limitednetwork bandwidth. This would also consume a large amount of the limitedstorage on the device itself, making this a slow and costly operation.

Wireless devices today are severely limited compared to desktopcomputers when it comes to aspects such as display real estate,processing power (CPU), storage, available power supply (battery) andnetwork capability in the form of available bandwidth for transfer ofdata.

The implementation of systems and methods to retrieve partial (skipped)content based on the navigational capabilities of the original documentssuch as table of content, hyperlinks and bookmarks is highly desirablefor a user on a wireless device. Retrieving only the desired content forreading or viewing on the device by recognizing these navigationalentities in the document and presenting them accordingly on the wirelessdevice saves time and preserves bandwidth and local storage.

Retrieving and viewing of partial (skipped) content on a wireless devicecan be accomplished through the use of two (2) elements. A clientapplication in the form of document viewer, such as an attachment viewer(AV), and a server application in the form of document server, such asan Attachment Server (AS), that through a series of interactive stepspresent the user with the navigational capabilities on a wirelessdevice, having capability similar in scope to those found in theoriginal document authoring environment.

On the document server, the server converts the document fordistribution. This conversion can be the result of a client request orcan occur automatically upon receipt of the document for distribution. Aportion of the converted document is distributed to the document viewerclient, where the converted document contains navigational pointsinterpretable by the document viewer client. Upon activation of such anavigational point, the server receives a request for a further portionof the converted document and distributes the further portion to therequesting client.

On the client side, the client can in some instances request initialconversion of the document for distribution; for example, FIG. 1 depictsa user of a wireless device (1 a) using the document viewer clientapplication to send a request to view a document attachment in an email,over the wireless network (1 b) through the internet (1 c) using theappropriate protocol to the document server (1 d). Alternatively, thedocument can be available in converted form at the point of initialrequest. In either case, a portion of the converted document is receivedand displayed on the wireless device via the document viewer client. Thedocument viewer client receives input information from the user of theclient. Based upon the received input the document viewer clientrequests one or more appropriate further portions of the converteddocuments. Upon receipt of such a portion, the document viewer clientdisplays the received further portion for further user interaction.

As discussed above, the server receives a request from the client forconversion of the original document such as an attachment in an emailmessage. Alternatively, the server begins conversion upon receipt of thedocument for distribution. FIG. 2 depicts a block diagram of conversionarchitectural components used in an exemplary server. FIG. 3 and FIG. 4depict exemplary processes to accomplish various tasks in theconversion. The server parses through the document using a documentdistiller module (2 a) and extracts defined informational entities fromthe file such as textual paragraphs, font type, font size, font styles,table of content, bookmarks, and hyperlinks etc. and populates adocument object model (DOM) module (2 b) in memory by adding validdocument entities (3 a). The parsing may include a binary examination ofthe document. This process may be assisted by reference to specificparsing rules according to a type associated with the given document. Ofthese informational entities; table of content, hyperlinks and bookmarksare recognized as navigational entities. When navigational entities aredetected (3 b) additional information such as source and targetdestination is recorded (3 c) in the document object model (DOM) whereeach source and destination target is assigned a document object modelidentifier (DOM ID) (3 d).

Upon completion of parsing the original document, the DOM can then bepaginated in a subsequent step with a pagination decorator module (2 c)in computer memory into segments of a certain size called chunks. Thepagination can occur at the time of conversion. Or, in instances whereparsing occurs automatically upon receipt of the document, paginationmay be deferred until a request for the document is received; in suchcases, the pagination can be made dependant upon a characteristic of therequest such as target device, requesting client application, etc.

The pagination starts with a chunk index value of 0 (4 a). Thepagination operation traverses the DOM by getting each DOM element (4 b)and if the DOM element is a navigational DOM ID (4C) it updates the DOMID with an additional information attribute containing the chunk index(4D) with the purpose of establishing a relationship between a documentobject model identifier and the corresponding chunk index. Thispagination operation that traverses the DOM element increases the chunkindex count every time the segment size exceeded n bytes (4 e).

Thus, when the device requests specific content corresponding to the DOMID the chunk index value indicates the correct chunk to be returned tothe device. The chunk index represents a pointer to the position in theDOM where the target content of a navigational link resides for thismethod. Consequently, this represents an exemplary approach to managingpointer based retrieval of document chunks; alternative addressingapproaches such as using other forms of unique descriptors or byte countoffsets could be used.

Upon completion of pagination by the server, the server generates outputdata according to a suitable transmission protocol for delivery to thetarget device. Such output data can, for example, be in a binary form tobe returned over the wireless network to the device. One such possibleoutput data format is UCS (Universal Content Stream) which is deviceagnostic and thus can be understood by wireless clients on differenthandheld devices. The UCS data is segmented into pieces of n bytes eachsuch as n=3000 and one such segment is retuned per client request. TheUCS is created by the server invoking a process that starts with theroot node of the DOM and traverses through the DOM elements one by oneand recording the DOM element content into equivalent UCS commands thatallows the original content to be repurposed by the client whilepreserving essential document characteristics such as font size, fontstyles, paragraphs, table of content, hyperlinks and bookmarks.

When the client requests a converted document from the server, theclient enters into a pending state until the server returns a response.In the scenario of a successful conversion the client is notified tothis effect with a status code and the first chunk of the converteddocument (e.g., n bytes of binary UCS data). The client then parses thedata and executes any command recorded therein to repurpose the originaldocument content in the client display area. The client presents textualcontent using font characteristics such as font styles and paragraphs,and also display navigational entities such as table of content,hyperlinks and bookmarks in an equivalent form to the originalapplication. Navigational entities are typically present as links, whichare displayed by the client in a suitable form such as link textunderlined by a dotted underline such as “This is a link”, or a similardistinct indication (FIG. 5), and can in some instances furthermoreprovide additional usability guidance by highlighting links while theuser navigates by moving the client cursor through the presentedcontent. The received chunk contains the matching DOM ID and chunk indexassociated with each such presented navigational entity.

A navigational link encountered in the client may be invoked such as byselecting the command “Retrieve” (FIG. 6) from the client menu. Theclient then sends a request to the server using the appropriatecommunication protocol and passes information about the link source anddestination characteristics (e.g. the DOM ID and chunk index are sent tothe server asking for content from the DOM at the target destination).

When the server receives a request for content from the client, thecommand sent by the client passes information associated with anavigational link in the form of DOM ID and chunk index, as discussedabove. The server uses the chunk index to locate the appropriateposition in the DOM in memory where the target content of thenavigational link points.

The second step to complete the request received from the client is topersist the content from the pointed position in the DOM into outputdata (e.g., the binary UCS format in the size of n bytes). The chunk isthen returned over the wireless network to the requesting client.

The client receives the new chunk over the wireless network and parsesthrough the commands therein. The client detects that this is theresponse to its preceding client request through the chunk indexinformation encountered by parsing the output data byte stream. Theclient then determined whether there is skipped content between whatalready resides on the device by looking at the chunk index informationon the device and the recently received chunk index. If a gap isdetected (e.g. a scenario such as chunk index 1 is on the device and anavigational link was invoked for chunk index 10) then there is skippedcontent remaining on the server (i.e. from chunk index 2-9).

To allow for a visual indication to the user of the client application,the display of the client application is then updated to reflect theskipped content by inserting a horizontal bar between the contentdisplayed for chunk index 1 and the content displayed for chunk index 10and labeling the horizontal bar with the text “Skipped Content” and thesize of the skipped content remaining at the server, in kilobytes (FIG.7).

If the client application issues multiple navigational link requests,upon receipt of the server response for the requests multiple horizontal“skipped content” bars are inserted where appropriate if there are gapsin the chunk index data on the device.

Furthermore, the horizontal inserted “Skipped Content” bar indicatorserves the purpose as acting as a selectable client control to allowinvocation of the client command “more”. The “More” command allows theclient application to request the succeeding chunk index from theserver. This means if the client application invokes “more” (FIG. 8)from the skipped content bar control inserted between chunks 1 and 10 onthe device, a request for chunk index 2 is sent to the server. Uponreceipt of the data chunk the client display is updated with theinserted new content and the “skipped content” bar is also updated toreflect the decrease in Kilobytes of the remaining skipped content backat the server.

The various systems and methods described above are given asillustrative examples only. It will be readily appreciated by thoseskilled in the art that many deviations may be made from the specificsystems and methods disclosed in this application.

The invention claimed is:
 1. A method comprising: displaying on adisplay of a mobile communication device content of a first segment ofan electronic document and a further segment of the electronic documentreceived at the mobile communication device, content of an electronicdocument between the first segment and the further segment being skippedcontent not received, wherein each of the first segment, the skippedcontent and the further segment comprise text of the electronicdocument, and consecutively numbered chunks of the text of theelectronic document, the first segment, the skipped content and thefurther segment each comprising respective segments of a same electronicdocument, the skipped content representing a gap in chunk indexinformation in the consecutively numbered chunks of the text; detectingwithin said mobile communication device the skipped content between thefirst segment and the further segment by processing the chunk indexinformation on the mobile communication device and detecting the gaptherein; providing on the display a horizontal bar indicator of theskipped content between the first segment and the further segment, thehorizontal bar indicator including a selectable client control forretrieving at least a portion of the skipped content at the mobilecommunication device; calculating and then displaying size of theskipped content within the horizontal bar indicator, the size of theskipped content provided in memory units to indicate an amount ofbandwidth that would be used if the skipped content were received at themobile communication device, and, when there are a plurality of gaps inthe chunk index information, providing, on the display device, aplurality of horizontal bar indicators of skipped content, betweenrespective received segments of the electronic document, each of theplurality of horizontal bar indicators corresponding to respective gapsin the chunk index.
 2. The method of claim 1, wherein the retrieving atleast a portion of the skipped content comprised invoking an additionalrequest for an additional segment of the electronic document.
 3. Themethod claim 1, wherein the first segment and the further segment arereceived from a server, and the at least a portion of the skippedcontent is retrieved from the server.
 4. The method of claim 1, whereinat least one of the first segment and further segment comprise commandscontaining content and document characteristics, and the first segmentand the further segment are displayed according to the documentcharacteristics.
 5. The method of claim 4, wherein the further segmentis received in response to user selection of a navigational entitywithin the first segment.
 6. The method of claim 5, wherein thenavigational entity comprises at least one of a table of content,hyperlinks, bookmarks, and underlined link text.
 7. The method of claim4, wherein the document characteristics include at least one of textualparagraphs, font type, font size, font style, table of contents,bookmarks, and hyperlinks.
 8. The method of claim 1, wherein the firstsegment and the further segment are received in a Universal ContentStream (UCS) format.
 9. The method of claim 1, wherein the first segmentand the further segment are received from a server upon transmission, tothe server, of a DOM ID (document object model identification) togetherwith the chunk index information.
 10. A mobile communication devicecomprising: a display device and a processor configured to: display, atthe display device, content of a first segment of an electronic documentand a further segment of the electronic document received at the mobilecommunication device, content of an electronic document between thefirst segment and the further segment being skipped content notreceived, wherein each of the first segment, the skipped content and thefurther segment comprise text of the electronic document, andconsecutively numbered chunks of the text of the electronic document,the first segment, the skipped content and the further segment eachcomprising respective segments of a same electronic document, theskipped content representing a gap in chunk index information in theconsecutively numbered chunks of the text; detect within said mobilecommunication device the skipped content between the first segment andthe further segment by processing the chunk index information on themobile communication device and detecting a gap therein; provide, on thedisplay device, an horizontal bar indicator of the skipped contentbetween the first segment and the further segment, the horizontal barindicator including a selectable client control for retrieving at leasta portion of the skipped content at the mobile communication device;calculate and then display size of the skipped content within thehorizontal bar indicator, the size of the skipped content provided inmemory units to indicate an amount of bandwidth that would be used ifthe skipped content were received at the mobile communication device,and, when there are a plurality of gaps in the chunk index information,provide, on the display device, a plurality of horizontal bar indicatorsof skipped content, between respective received segments of theelectronic document, each of the plurality of horizontal bar indicatorscorresponding to respective gaps in the chunk index.
 11. The mobilecommunication device of claim 10, wherein the retrieving at least aportion of the skipped content comprised invoking a request for anadditional segment of the electronic document.
 12. The mobilecommunication device claim 10, wherein the first segment and the furthersegment are received from a server, and the at least a portion of theskipped content is retrieved from the server.
 13. The mobilecommunication device of claim 10, wherein at least one of the firstsegment and further segment comprise commands containing content anddocument characteristics, and the first segment and the further segmentare displayed according to the document characteristics.
 14. The mobilecommunication device of claim 13, wherein the further segment isreceived in response to user selection of a navigational entity withinthe first segment.
 15. The mobile communication device of claim 14,wherein the navigational entity comprises at least one of a table ofcontent, hyperlinks, bookmarks, and underlined link text.
 16. The mobilecommunication device of claim 13, wherein the document characteristicsinclude at least one of textual paragraphs, font type, font size, fontstyle, table of contents, bookmarks, and hyperlinks.
 17. The mobilecommunication device of claim 10, wherein the first segment and thefurther segment are received in a Universal Content Stream (UCS) format.18. The mobile communication device of claim 10, wherein the firstsegment and the further segment are received from a server upontransmission, to the server, of a DOM ID (document object modelidentification) together with the chunk index information.
 19. Anon-transitory computer-readable medium storing a computer program,wherein execution of the computer program is for: displaying on adisplay of a mobile communication device content of a first segment ofan electronic document and a further segment of the electronic documentreceived at the mobile communication device, content of an electronicdocument between the first segment and the further segment being skippedcontent not received, wherein each of the first segment, the skippedcontent and the further segment comprise text of the electronicdocument, and consecutively numbered chunks of the text of theelectronic document, the first segment, the skipped content and thefurther segment each comprising respective segments of a same electronicdocument, the skipped content representing a gap in chunk indexinformation in the consecutively numbered chunks of the text; detectingwithin said mobile communication device the skipped content between thefirst segment and the further segment by processing the chunk indexinformation on the mobile communication device and detecting the gaptherein; providing on the display a horizontal bar indicator of theskipped content between the first segment and the further segment, thehorizontal bar indicator including a selectable client control forretrieving at least a portion of the skipped content at the mobilecommunication device; calculating and then displaying size of theskipped content within the horizontal bar indicator, the size of theskipped content provided in memory units to indicate an amount ofbandwidth that would be used if the skipped content were received at themobile communication device, and, when there are a plurality of gaps inthe chunk index information, providing, on the display device, aplurality of horizontal bar indicators of skipped content, betweenrespective received segments of the electronic document, each of theplurality of horizontal bar indicators corresponding to respective gapsin the chunk index.